Potential Information Maximization: Potentiality-Driven Information Maximization and Its Application to Tweets Classification and Interpretation
نویسندگان
چکیده
The present paper aims to apply a new informationtheoretic learning method called “potential information maximization” to the classification and interpretation of tweets. It is well known that social media sites such as Twitter play a crucial role in transmitting important information during natural disasters. In particular, since the Great East Japan Earthquake in 2011, Twitter has been considered as one of the most efficient and convenient communication tools. However, since there is much redundant information contained in tweets, it is critical that methods be developed to extract only the most important information from them. To cope with complex and redundant data, a new neural information-theoretic learning method has been developed for this purpose. The method aims to find neurons with high potential and maximize their information content to reduce redundancy and to focus on important information. The method was applied to real tweet data collected during the earthquake. It was found that the method could classify the tweets as important and unimportant more accurately than other conventional machine learning methods. In addition, the method made it possible to interpret how the tweets could be classified based on the examination of highly potential neurons.
منابع مشابه
Throughput Maximization for Multi-Slot Data Transmission via Two-Hop DF SWIPT-Based UAV System
In this paper, an unmanned aerial vehicle (UAV) assisted cooperative communication system is studied, wherein a source transmits information to the destination through an energy harvesting decode-and-forward UAV. It is assumed that the UAV can freely move in between the source-destination pair to set up line of sight communications with the both nodes. Since the battery of the UAV may be limite...
متن کاملA New GIS based Application of Sequential Technique to Prospect Karstic Groundwater using Remotely Sensed and Geoelectrical Methods in Karstified Tepal Area, Shahrood, Iran
In this research, recognition of karstic water-bearing zones using the management of exploration data in Kal-Qorno valley, situated in the Tepal area of Shahrood, has been considered. For this purpose, the sequential exploration method was conducted using geological evidences and applying remote sensing and geoelectrical resistivity methods in two major phases including the regional and local s...
متن کاملEnhancing Learning from Imbalanced Classes via Data Preprocessing: A Data-Driven Application in Metabolomics Data Mining
This paper presents a data mining application in metabolomics. It aims at building an enhanced machine learning classifier that can be used for diagnosing cachexia syndrome and identifying its involved biomarkers. To achieve this goal, a data-driven analysis is carried out using a public dataset consisting of 1H-NMR metabolite profile. This dataset suffers from the problem of imbalanced classes...
متن کاملUnification of Information Maximization and Minimization
In the present paper, we propose a method to unify information maximization and minimization in hidden units. The information maximization and minimization are performed on two different levels: collective and individual level. Thus, two kinds of information: collective and individual information are defined. By maximizing collective information and by minimizing individual information, simple ...
متن کاملRobust Method for E-Maximization and Hierarchical Clustering of Image Classification
We developed a new semi-supervised EM-like algorithm that is given the set of objects present in eachtraining image, but does not know which regions correspond to which objects. We have tested thealgorithm on a dataset of 860 hand-labeled color images using only color and texture features, and theresults show that our EM variant is able to break the symmetry in the initial solution. We compared...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016